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Abstract 

In the first part of this paper we have derived achievable transmission rates for the (single-band) interference 
relay channel (IRC) when the relay implements either the amplify-and-forward, decode-and-forward or estimate-and- 
forward protocol. Here, we consider wireless networks that can be modeled by a multi-band IRC. We tackle the 
existence issue of Nash equilibria (NE) in these networks where each information source is assumed to selfishly 
allocate its power between the available bands in order to maximize its individual transmission rate. Interestingly, it 
is possible to show that the three power allocation (PA) games (corresponding to the three protocols assumed) under 
investigation are concave, which guarantees the existence of a pure NE after Rosen fS). Then, as the relay can also 
optimize several parameters e.g., its position and transmit power, it is further considered as the leader of a Stackelberg 
game where the information sources are the followers. Our theoretical analysis is illustrated by simulations giving 
more insights on the addressed issues. 

Index Terms 

Cognitive radio, game theory, interference relay channel, Nash equihbrium, power allocation games, relay channel, 
Stackelberg game. 



I. Introduction 

A possible way to improve the perfomiance of a network comprising pairs of source-destination links, 
interfering with each other, is to add some relaying nodes in the network. In the first part of this two- 
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part paper, we have studied the special case of a network composed of two source nodes, each of them 
sending private messages to its respective destination with the help of a common relay node. The emphasis 
was on the relaying protocol and determination of achievable transmission rates. In this second part paper, 
we assume that there are several relaying nodes operating in non-overlapping frequency bands and that the 
source nodes have to allocate their powers between the available frequency bands in order to maximize their 
transmission rates. Therefore, the focus in this part is on the decentralized power allocation (PA) problem. 
One of our motivations for analyzing such a scenario is to acquire a better understanding of a system 
where two cognitive transmitters, each of them communicating with its respective receiver, are offered the 
opportunity to re-exploit frequency bands (unused locally in space or time) in which a relaying node is 
available in order to further increase the performance of the system. As the transmitters are assumed to be 
decision makers that can freely choose their own resource allocation policies while selfishly maximizing 
their transmission rate and as the communications interfere in each band, the resource allocation problem 
can be modeled by a non-cooperative game. Then, several key questions arise: What is the influence of the 
relaying protocol on the game formulation? Is there a predictable state (equilibrium) at which the system 
will effectively operate? 

Before describing the structure of this paper, presenting the system model and tackling these issues, we 
will mention the closest contributions to those presented in this paper. As mentioned in the Abstract, the 
network under consideration can be modeled as a set of parallel (or orthogonal) interference relay channels 
(IRCs). In Part I, we have mentioned that this channel has been introduced by HlISl. In these works, the 
problem is not formulated as a game and, therefore, no equilibrium analysis is conducted. Additionally, the 
authors study in detail only the case of the decode-and-forward (DF) protocol while, here, we exploit the 
results derived in Part I for the three dominant classes of protocols i.e., amplify-and-forward (AF), DF and 
also estimate-and-forward (EF). The closest work concerning the game-theoretic approach we adopt is the 
work by Yu et al. I6J. In [[61, the authors studied the distributed PA game in frequency selective interference 
channels. The problem we present here is a generalization of the problem addressed by Yu et al. for which 
a relay (implementing a certain protocol) is available on each band or sub-carrier. 

This paper is structured as follows. Sec. HI] describes the system model. In Sec. Hill |IV] and |Vl we analyze 
the existence of a Nash equilibrium (NE) in the non-cooperative PA game when zero-delay scalar AF 
(ZDSAF), DF and EF protocols are respectively considered. Sec. UlI] is itself divided into two sub-sections, 
one dedicated to the general ZDSAF protocol and the other to a special case where the amplification gains 
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at the relay nodes are independent of the power of the signals they receive. Sec. |IV] also comprises two 
special cases, one case where the game is due to the power allocation between the bands and another one 
where it is due to the power allocation between the fine and coarse messages in the DF protocol (following 
the comments made in Part I). Then, in Sec. |VIl we formulate the PA problem as a Stackelberg game 
by introducing the relay as an additional player of a hierarchical game in order to know how the relay 
parameters should be tuned in practice. Sec. I VIII provides, in particular, simulations showing the importance 
of choosing the amplification factor properly for the ZDSAF protocol and optimally locating the relay. 
Summarizing remarks and possible extensions are given in Sec. IVIIII 

II. System Model 

The system under investigation comprises two source nodes Si, S2, transmitting their private messages to 
their respective destination nodes Vi, V2. To this end, each source can exploit Q non-overlapping frequency 
bands (the notation (q) will be used to refer to band q E {1, . . . , Q}) which are assumed to be unitary. The 
signals transmitted by Si and ^2 in band (q), denoted by x{^^ and ^2''^ respectively, are assumed to be 
independent and subject to power constraints: 

Q 

VzG{l,2}, J2^\xl'^\'<Pi. (1) 

q=l 

For i G {1,2}, we denote by 6'-^^ the fraction of power that is used by Si for transmitting in band (q) 
that is E|xf^''P = Of^ Pi. On each band (q), a relaying node IZ'^'^^ is available. Each relaying node is 
assumed to operate in the full-duplex mode. The transmit power at each relay is also subject to a constraint 
E|Xr^^P < Pr'^\ Note that the relay transmit power in band (q) is subject to an individual constraint, which 
implicitly means that the relays are not co-located (operating in the same frequency band) Otherwise, for a 
single relay operating in Q bands, the power constraint would be Yl^=i'^\^r'^ \^ < Pr, which is not what 
we assume in this paper. With these notations, the baseband signal received by Vi, V2 and TZ^'^^ in band 
(q) express as: 

' yI'^ = h^^^xi'^ + h^^^X^"^ + h^^^X^'^ + z['^ 

< Yi'^ = /iSJxf^) + /igx^"^ + /liJx^^) + Z^^) (2) 

where Z^"'' ~ A/'(0, A^f'^''), i G {l,2,r}, represents the Gaussian complex noise on band (q) and, for all 
i,j G {l,2,r}, h\j^ is the channel gain between Si and Vj in band (q), with the convention /i^ = 0. 
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As justified in Part I, we consider a realistic situation where only large scale propagation effects can be 
taken into account by the users to optimize their rates. Thus the channel gains are considered to be static. 
Concerning channel state information (CSI), we will always assume coherent communications between each 
transmitter-receiver pair (Si,'Di) whereas, at the transmitters, the information assumptions will be context- 
depending and deduced from the latter. At Vi and V2, single-user decoding (SUD) will always be assumed. 
It is a realistic assumption in a framework where devices operate in unlicensed bands in an a priori non- 
coordinated manner. At the relays Tl^^\ q E {1,...,Q}, the reception scheme implemented will depend 
on the protocol assumed. The expression of the signals Xr''^ depend on the respective protocol and will 
therefore also be explicated in the corresponding sections. At last, we will use the same specific notations 
as in Part I: The capacity function for complex signals is denoted by C{x) = log2(l + x); for any real 
a E [0, 1], the quantity a will stand for a = 1 — a; the notation —i will mean that — z = 1 if z = 2 and 
-z = 2 if z = 1. 

in. Equilibrium analysis for the ZDSAF protocol 

In this section, we assume that the relay nodes implement the ZDSAF protocol, which has already been 
described in Part I. One of the nice features of the (analog) ZDSAF protocol is that relays are very easy to be 
deployed since they can be used without any change on the existing (non-cooperative) communication system. 
The amplification factor/gain for the relay TZ^''"' will be denoted by ai^\ We have discussed the optimal choice 
of this parameter in Part 1. In this part, we consider two choices for ai'^'' : (A) the case where it is chosen to 
saturate the transmit power constraint at 71^^'^ that is, ai'^'^ = , / '^'l!! „ = \ / („, „ — ,„) '^'l!! „ — ivi — ^ — (^^r^\ 
(B) the case where it is a constant w.r.t. 9^^'' and 6^2^ in which case it will be denoted by In this section, 
we prove the existence of an NE in the most general case, i.e, case (A). The reason for analyzing case (B) is 
at least twofold. First, in practice, it corresponds to the situation where the relay is an analog repeater, which 
is very easy to be implemented. The second motivation is technical. While the determination of the selfish 
PA policies for the different sources at the equilibrium is always possible numerically, case (B) allows one 
to analytically determine the number of Nash equilibria and corresponding policies. Before treating cases 
(A) and (B) in detail, we now describe the non-cooperative PA game with ZDSAF. Indeed, one of our 
goals is to know how each selfish transmitter is going to allocate its available power between the different 
bands, given the fact that the transmitters are able to observe each other and react accordingly. This situation 
of interaction can be modeled by a non-cooperative game where: (i) the players of the game are the two 
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information sources or transmitters; (ii) the strategy of transmitter i consists in choosing 6^ = {6l^\ . . . , Of^^) 
in its strategy set Ai = \ 9_i G [0, 1]*^ '^^^f^ — ''^ I ' ^^^^^ utility (or payoff) function of user i G {1,2} 



is its achievable transmission rate given by uf^ {6^,6_-) = R^'^^'^^ {9^''\6^fl) where r!^^'^^ is the rate 

9=1 

user i obtains by using band (q) when the ZDSAF protocol is used by the relay TZ'^^K The latter quantity 
has been shown to be (in Part I): 



where 

Mail) 



{g),AF 
Vi 



(4) 



with Vz G {1, 2}, j = -i and p:^ = in case (A), ai''^ = ai''\e'f\ei^^) and, in case (B), ai'^^ = aI'^K 
Without loss of generality and for sake of clarity we will assume in Sec. Hill that V(i,g) G {l,2,r} x 
{1, . . . , Q}, A'^'''^'' = A^, Pr'^'' = Pr and we introduce the quantities Pi = Additionally, we suppose 
that the game is played once (one-shot game), the users are rational (each selfish player does what is 
best for itself) and the game is with complete information that is, every player knows the triple = 
(/C, {Ai)i^jc, {uf^)i^ic), where /C = {1, 2} is used to refer to the set of players. From now on, we will call 
state of the network the vector of power fractions that the users allocate to the IRCs i.e., 9 = (^1,^2)- 
In distributed networks where users are selfish and free decision makers who interact with each other, a 
desirable feature for the network is the existence of an equilibrium or a stable operating state of the system. 
In this respect, the Nash equilibrium [|7]| corresponds to a state of the network from which the users do 
not have any incentive to deviate unilaterally, because otherwise they would lose in terms of utility; this 
translates mathematically by the following definition. 

Definition 3.1: [Nash equilibrium] The state {0i,O*_.i) is a pure NE of the game Q^^ if Wi G /C,V^^ G 

It turns out that the existence of such a stable state is guaranteed in both scenarios (cases (A) and (B)) 
under investigation. Proving this is the purpose of the following two sub-sections. 
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A. Case of full power regime 

In this section, af'^ = af\o'f\62'') Vg G {1, ....Q}. Under this assumption we can state the following 
existence theorem. 

Theorem 3.2: [Existence of an NE for ZDSAF when ai^^ = af\9^f', &2^)\ There exists at least one pure 
NE in the PA game . 

The proof of this theorem is provided in Appendix |Al Here, we will just give a few comments on it. It 
is based on Theorem 1 in |[3l. The latter theorem states that in a game with a finite number of players, 
if for every player (i) the strategy set is convex and compact, 2) its utility is continuous in the vector of 
strategies and 3) concave in its own strategy, then the existence of at least one pure NE is guaranteed. In 
our setup it is easy to check that conditions 1) and 2) are met. Verifying condition 3) for the utility Ui is 
however more involving. Indeed, it can be checked that the second-order derivative of R^f*'^^ w.r.t. 6^f^ is 
intractable. It turns out that proving that this second-order derivative is non-positive is possible if a proper 
change of variables is made before the sign analysis, as shown in the Appendix lAl 

The proved theorem indicates, in particular, that whatever the values of the channel gains hij, G 
{l,2,r}^, there exists an equilibrium. As a consequence, if some relays are added in the network, the 
transmitters will adapt their PA policies accordingly and, whatever the locations of the relays, the selfish 
behavior of the transmitters will drive the network to an equilibrium. This is a nice property for the system 
under investigation. The question to know which equilibrium is also an interesting issue to be treated. We 
will treat it partially in this paper and leave it as an extension of this work. In the next sub-section we 
analyze a useful and simple case where the equilibria can be determined. 

B. Case of fixed amplification gains 

In this section, we assume Q = 2 and that each transmitter saturates its power constraint that is, Vz G 
{1,2}, ^ + = 1. Thus, for the sake of clarity, we rename the power fractions in bands (1) and (2) as: 
9\^^ = 6*1 and 6*2^'' = $2 in the remaining of this section. More importantly, we suppose that Vg G {1,2}, 
ai'^^ = aI'''' G [0, ar{l, 1)], a fixed constant. This choice is interesting in terms of both, the simplicity of the 
relays (simple repeaters) and the information assumptions. It is true that choosing the amplification gain 
ttr = al?\6i,62), as above, allows the relay nodes to exploit all their available powers. However, it also 
assumes the presence of a mechanism to estimate the power of the received signals at these relay nodes. 
While this can be easy for a digital relay transceiver that knows the possible training sequences used by 
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the sources, it might be impossible if the relay is a simple analog power amplifier without automatic gain 
control. Furthermore, the case ai'^^ = Ai'^^ has a very interesting convergence property which allows to relax 
some information assumptions at the transmitters. To understand this and determine the Nash equilibria, 
consider the best responses (BRs) of the different players. The BR of player i to player j is defined by 
BRi{9j) = argmaxui{6i,6j). In general, it is a correspondence but in our case it is just a function. The 
equilibrium points precisely correspond to the intersection of the BRs of the two users. In this case, using 
the Lagrangian functions to impose the power constraint, it can be checked that: 

Fi{e,) if < F,{ej) < 1 

1 if F^iOj) > 1 (5) 
otherwise 

where j = -i, FiiOj) = -gj^i + J is an affine function of 6*^; for (ij) G {(1,2), (2, 1)}, c^i = 2\A'i^h„hir + 

^iil \Ar QriQir ~\~ 9ii\ Pi\ Cij = \Ar fl^ihir -\- hii\ \Ar Qridjr 9ji\ Pj \Ar hrihjj- -\- hji\ \Ar QriQir ~\~ 9ii\ Pj'-: 
di = \Ai ^ hrihij.-\-hii\'^[\Ai ^ gri9ir~^9ii\'^ Pi~^\^r ^ gri9jr~^ 9ji\'^ Pj ~^ "* Ifi^r-i P + 1] ~ | ^ 9ri9ir~^ 9ii\'^ i^r ^|/iriP + 

1). By studying the intersection points between BRi and BR2, one can prove the following theorem (the 
proof is provided in Appendix |B]). 

Theorem 3.3 (Number of Nash equilibria): For the game with fixed amplification gains at the relays, 



BRi{9j) 



(i.e., -^T-y = 0), there can be a unique NE, two NE, three NE or an infinite number of NE, depending on 

del" 

the channel gains hij, G {l,2,r}^. 

If the channel gains are thought of as the realizations of continuous random variables, it is easy to prove 
that the probability of observing the necessary conditions on the channel gains for having two NEs or an 
infinite number of NEs is zero. Said otherwise, if one puts the relays in arbitrary places, there will be, with 
probability one, either one or three NE, depending on the channel gains. When the channel gains are such 
that the NE is unique, the unique NE can be shown to be: 

5)NE _ Q* _ { ^22^1 - Ci2d2 Ciid2 - 021^1 \ 
VC11C22 - C12C21 ' C11C22 - C12C21/ 

When there are three NE, it is a priori impossible to predict the NE that will be effectively observed in 
the one-shot game. In fact, in practice, in a context of cognitive transmitters, it is possible to predict the 
equilibrium of the network. In practice, a multi-band IRC with cognitive transmitters will work as follows. 
First, there is no reason why the sources should start transmitting at the same time. Thus, one transmitter, 
say i, will be alone and using a certain RA. policy. The transmitter coming after, namely S^i, will sense 
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its environment and play its BR to what it observes. As a consequence, user i will move to a new policy, 
maximizing its BR to what transmitter —i has played. User i will then re-adapt its policy and so on. The 
key question is: does this procedure converge? This is where the nice property of the BRs come into play. 
It can be checked that this procedure will converge to one of the three NE predicted by the one-shot game. 
The observed NE can be predicted by knowing the initial network state that is, the FA policy played by 
the first player starting to transmit; this will be illustrated in Sec. IVIII In fact, even though the BRs are just 
piecewise affine and not affine (as in a Coumot duopoly |[8l), the game has the same convergence property 
as a Cournot duopoly. To implement such an iterative procedure (called the Coumot tatonnement process 
in BH), it can be checked that the transmitters need to know less network parameters than in the original 
game where the amplification factor saturates the constraint. In fact, the needed parameters can be acquired 
by realistic sensing techniques or feedback mechanisms based on standard estimation procedures, which is 
particularly easy in fast fading environment since the path loss can be considered to be constant during a 
period of time long enough to store a large number of channel realizations. 

IV. Equilibrium analysis for the DF protocol 

In this section, the relays are assumed to be able to reliably decode the messages sent by the two 
information sources. In Gaussian relay channels, the DF protocol can perform better than the AF and EE 
protocols in terms of transmission rate; this typically occurs when the relay is close to the source relatively 
to the destination The principle of the DF protocol is reviewed in Part I. Here we will just mention that, 
in the Gaussian relay channel, the source superposes two codewords corresponding to two messages i.e., a 
coarse message reliably decodable by the destination alone and a fine message that the destination reliably 
decodes with the help of the relay node. This explains the following structure for the signals transmitted 
by Si and S2 in the type of networks we consider in this paper: xl'^^ = xljj + J \) ^' (q) where the 
signals X-'(l and X^f are Gaussian and independent. The first signal xj^'^Q precisely corresponds to the coarse 
message while the second one corresponds to the fine message. Indeed, at the relay 7l^'^\ the transmitted 
signal writes as: Xr"^ = X^'^^ + X^2''. We see that the problem of resources allocation is more complex 
than in the case of AF. There are actually three power allocation problems instead of one: (1) like AF, each 
information source needs to allocate its power between the Q different bands; (2) in each band, each source 
needs to tune its cooperation degree t- '^^ with the relay that is, it has to allocate its power in band (q) between 
the coarse and fine signals; (3) each relay has to allocate its power between the two cooperation signals 
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intended for Vi and V2. The parameters ^-^^ r^''^ and z/^'^^ translate these three PA problems respectively: 
X^"^ ~ ^{0,91"^ Pi), Xfj ~ Ar(0, (1 - T^^'^VS'^^O and ~ Ar(0, P^^"^^). In this paper, the authors 
restrict their attention to two special cases of PA games and let the general case where the strategy of a 
user is {6_^,t_^ = {Ol^\ ...^9^^\Ti^\ ...,t/*^^) as an interesting and significant extension of this paper. First, 
we consider the PA game (1) by supposing the cooperation degrees and PA policies at the relays to be 
fixed (Sec. IIV-AI ). Second, we analyze the PA game (2) where only the cooperation degrees can be tuned, 
the other parameters being assumed to be fixed (Sec. IIV-BI) . The first game allows one to be coherent with 
the PA game with AF while the second game allows one to study the game introduced in Part I. In both 
sub-sections we assume that each relay implements SUD. This choice is fair for the transmitters, does not 
involve any additional signalling and allows us to cope with the constraint on the sum-rate on each multiple 
access channel formed by (Pi, 1)2) — 7^*^"^^ which would make the game more complex to be played and is 
therefore left as an extension of this paper. Also, in both sub-sections the utility of a transmitter will have 
the same expression that is, Vi G {1, 2}, nf^ = Yl^=i R^^'^^ where 



(9),DF 
2 



(7) 



D(g),DF ^ ^ / |^lr| (1-^1 j^l A 



p(g),DF _ ^ 



p(g),DF _ ^ 



I /i^'i' I ""ej"' Pi + 1 hi\^ pi"^ +2Rc (h[f hil^ • * ) y^peppM^)^ 

hit f (>['''> Pi^hlf pi/(9)P,(^'+2Rc(h(«'h(|''*) y^p^^)p;T^^)^+7v(«) 



(8) 



and (//(^), r{'^\ Tg'^-') G [0,1]^, and z/^"?) = 1 — iy^'i\ Note that we have not indicated the arguments of uf^ 
i.e., the strategies of the users, since these are context-dependent. In Sec. IIV-A[ the strategy of user i is 
e. = (^f \ eP) whereas in Sec. HV-Bl it is = (7;^^^ t^). 

A. Allocating the power between the bands 

Here we assume that the cooperation degrees (2:1,7:2) are fixed and the strategy for user i consists in 
choosing ^j. The PA game is similar to that encoutered with the AF protocol. Therefore, here again, the 
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problem for proving the existence of an NE in this game is to show the concavity of uf^ w.r.t. 9^^. In contrast 
with AF, this task can be checked to be very easy for the rates achieved by using DF. Having a concave 
game in the sense of Rosen [3J, the following existence theorem follows. 
Theorem 4.1: [Existence of an NE for the DF protocol] 

r Q 



q=l 



The game defined by Q^^ = (/C, (A)igK;, (^^?^(^i,^_^))^ec) with JC = {1, 2} and Ai = j^^ G [0, 1]^ 

has always at least one pure NE. 
While the existence is easy to prove, the uniqueness remains a though problem. For example, the diagonally 
strict concavity condition of [3], which is a sufficient condition for uniqueness for concave games, is not 
trivial to be checked. 



B. Allocating the power between the coarse and fine signals 

We now suppose that the power fractions 9^ and ^2 allocated to the Q bands are fixed (for example 
9^^ = ^ and 92 = ^, which corresponds to a uniform PA). We therefore obtain exactly the game we have 
highlighted in Part I (for the case Q = 1). Among the three dominant classes of protocols (AF, DF, EE) 
we see that the DF protocol is the only protocol that introduces a game in a natural manner that is, even if 
there is no multiple antenna transmitters, multi-carrier systems, multi-slot transmissions, etc. Fortunately, the 
game where each transmitter tunes their cooperation degree with the relays TZ^'^^ has at least one equilibrium, 
as stated in the following theorem. 

Theorem 4.2: [Existence of an NE for the DF protocol] 
The game defined by = (/C, {Ai)i^ic, {uf^{Ti,r_.i))i^ic) with K = {1, 2} and Ai = [0, 1]*^, has always 
at least one pure NE. 
The proof also follows from 

V. Equilibrium analysis for the EE protocol 

In this section, we make the same assumptions as in Sec. |IV] concerning the reception schemes and PA 
policies at the relays: we assume that each relay TZ^''\ Vi and V2 implement single-user decoding and the 
PA policy at each relay i.e., u'^'^^ is fixed. Each relay now implements the EE protocol. The principle of the 
EE protocol in the relay channel is to send an approximated version of the relay's observation signal to 
the receiver. In Part I, we have seen that, in IRCs, the EE protocol can have at least two natural variants 
depending on whether the relay constructs a unique estimate of the signal it receives that is decodable by 
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both receivers or two estimates, each of them being decodable by one or two receivers. We have called the 
two corresponding schemes the single-level compression scheme and bi-level compression scheme. Both 
because of the lack of space and for simplicity reasons, we treat here the case of EF with two compression 
levels. The single-level compression scheme implies a more complicated expression of the compression 
noise level that makes the concavity issue strenuous to deal with. Under this assumption, the utility for user 
i e {1,2} can be expressed as follows: wf^(^i,^_j) = Yl^=i R^^"^^ where 



(9) 



wz.l 







4!f' 




4«j)^^(9)p^'')+Ar('')' 










4!f' 


I'vi 


%(9)P^''' 











(10) 



olocoij I ne game aejinea oy y = 
Q ^ 

^^0^^^ — ^ r ^'^^ always at least 

9=1 J 



G [0, 1], A(^) = p^i'^^Pi + p^?^P2 + N^'\ 4'^ = h^hir'*Oi^Pi + h^}h!i^'*e^^P2 and 4'^^ = 
h^u^i^J'*^!^^ Pi + ^22^^2r^'*^2''^-P2- What is interesting with EF is that, here again, one can prove that the 
utility is concave for every user. This is the purpose of the following theorem. 

Theorem 5.1: [Existence of an NE for the bi-level compression EF protocol] The game defined by 

(/C, (A).ec, (tif^(^„^_J).e;c) with /C = {1, 2} and A, = G [0, 1]« 
one pure NE. 

To be able to apply Theorem of Rosen |[3l, we have to prove that the utility nf^ is concave w.r.t. 6_^. The 
problem is less simple than for DF because the compression noise A'^^^j ■, which appears in the denominator 
of the capacity function in Eq. also depends on the strategy 6_i of transmitter i. It turns out that it is 
still possible to prove the desired result as shown in Appendix O 

VL Stackelberg formulation 

We have mentioned that a strong motivation for studying IRCs is to be able to introduce relays in a 
network with non-coordinated and interfering pairs of terminals. For example, relays could be introduced 
by an operator aiming at improving the performance of the communications of his subscribers. In such a 
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scenario, the operator acts as a player and more precisely as a game leader in the sense of IfTTI . In IfTTI . the 
author introduced what is called nowadays a Stackelberg game. This type of hierarchical games comprises 
one leader who plays in the first step of the game and several players (the followers) who observe the leader's 
strategy and choose their actions accordingly. In our context, the game leader is the operator/engineer who 
chooses the parameters of the relays and the followers are the cognitive transmitters that adapt their PA 
policy to what they observe. What are these parameters? In the preceding sections we have mentioned some 
of them: 

1) the location of each relay; 

2) in the case of AF, the amplification gain of each relay; 

3) in the case of DF and EF, the power allocation policy between the two cooperative signals at each 
relay i.e., the parameter 

Therefore, the group of relays can be thought of as a player who maximizes its own utility. This utility 
can be the individual utility of a given transmitter (picture one WiFi subscriber who wants to increase his 
downlink throughput by locating his cellular phone somewhere in his apartment while his neighbor can also 
exploit the same spectral resources) or the network sum-rate (in the case of an operator). In the latter case, 
the operator possesses some degrees of freedom to make the Nash equilibrium more efficient. In the next 
section, we will use implicitly the Stackelberg formulation to determine the best location for a relay in a 
given scenario. 

VII. Simulation results 

First, we show that in the PA game with ZDSAF, one can have three possible Nash equilibria. For a given 
typical scenario, which does not need to be given here but can be found in [2], Fig. [H represents the BRs 
of the two transmitters. We see that there are three intersection points and therefore three Nash equilibria. 
As explained in Sec. Hill the effectively observed NE in a one-shot game is not predictable without any 
additional assumptions but the Coumot tatonnement procedure converges towards a given NE which can be 
predicted from the sole knowledge of the starting point of the game, namely 9^ or O^. 

In the rest of this section we focus on the Stackelberg formulation where the strategy of the leader is 
respectively the relay amplification factor, position and power allocation between the cooperative signals. 
The system considered is composed of an IRC in parallel with an interference channel (IC) lfT2l . thus Q = 2. 
All the simulations provided are obtained by applying the Cournot tatonnement procedure. The simulation 
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setup is as follows. The source and destination nodes are located in fixed locations in the region [— L, L]^ of 
a plane, with L = 10 m, such that the relative distances between the nodes are: dn = 6.52m, di2 = 8.32m, 
d2i = 6.64m, (^22 = 6.73m. We assume a path loss model for the channel gains \gij\, \hij\. For the path 

loss model we take \hij\ = (^^^ ^ and \gij\ = (^^^ ^ for G {l,2,r}^ where rfo = 1 m is a 
reference distance. As in Part I, to avoid any divergence for the path loss in dij = we assume that the 
relay is not in the plane but at e = 0.5 m from it i.e., the relay location is given by the {x-jz, y-jz, z-jz) where 
z-jz is fixed and equals 0.5 m. We will also assume that A^^^^ = A^^-* = Ni, i G {1, 2}, Nr^^ will be denoted 
by Nr and also P^^ = Pr, Ai^^ = Aj., ai^^ = a^, a'r'^ = a^., v^^^ = v. 

Optimal relay amplification gain for ZDSAF. First we consider the ZDSAF relaying scheme assuming 
a fixed amplification gain Ur = Ar (Sec. IIII-B|) . We want to analyze the influence of the value of the 
amplification factor, Ar G [0,0^(1, 1)], on the achievable network sum-rate at the NE. This is what Fig. [2] 
shows for the following scenario: e = 0.5 m. Pi = 20 dBm, P2 = 23 dBm, Pj. = 22 dBm, A'^i = 10 dBm, 
A^2 = 9 dBm, Nr = 7 dBm, 7^^^ = 7^^^ = 2. We observe that the optimal value is A* = 0.05 and is not 
equal to the one saturating the relay power constraint 5^(1, 1) = 0.17. This result illustrates for the sum-rate 
what we have proved analytically in Part I for the individual rate of a given user. 

Optimal relay location for ZDSAF. Now, we consider the ZDSAF when the relay full power regime is 
assumed, = dr{Oi, 62) (Sec. lIII-Al) and study the relay location problem. Fig. [3(a)] represents the achievable 
network sum-rate as a function of the relay position {x-jz^yn) G [— L, L]^ for the scenario: Pi = 20 dBm, 
P2 = 17 dBm, Pr = 22 dBm, A^i = 10 dBm, A^2 = 9 dBm, A^^, = 7 dBm, 7(1' = 2.5 and 7(2) = 2. We 
observe that there are two local maximum that actually correspond to the points that maximize the individual 
achievable rates. Many simulation results have confirmed that, when the source nodes are sufficiently far 
away from each other, maximizing the individual rate of either user at the NE amounts to locating the relay 
on one of the the segment between Si and Vi. This interesting and quite generic observation can be explained 
as follows. For this purpose, consider Fig. |3(b)| which is a temperature image representing the values of 61 
and 02 for different relay positions in [— The region where {61,62) = (1,0) (resp. {61,62) = (0,1)) 
is the region around Si (resp. ^2). We see that the intersection between these regions corresponds to small 
area. This quite general observation shows that the selfish behavior of the users leads to self-regulating the 
interference in the network. Said otherwise the selfish behavior of each transmitter leads it to leave the relay 
to the other transmitter when it is too far from it. Thus, when one transmitter uses the relay, it is often alone 
and sees no interference. In these conditions, by considering the path loss effects it can be proved that the 
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optimal relay position is on the segment between the considered source and destination nodes. This also 
explains why the position that maximizes the network sum-rate is also on one of the segments from Si to 

Optimal relay power allocation at the relay for DF and EE For the DF protocol we fix the cooperation 
degrees ri = and r2 = 0. In Fig. HI we plot the achievable sum-rate at the equilibrium as a function of 
the relay power allocation policy is u E [0, 1] (with the convention = u^-^^) for the scenario: x-ji = m, 
2/7^ = m, Pi = 22 dBm, P2 = 17 dBm, Pr = 23 dBm, A^i = 7 dBm, N2 = 9 dBm, iV^ = dBm, 
7*^^) = 2.5 and 7^^^ = 2. We observe that, for both protocols, the optimal power allocation u* = 1, meaning 
that the relay allocates all its available power to the better receiver, Vi. In this case the relay is in very good 
conditions and can therefore reliably decode the source messages. This explains why DF outperforms EF 
which is in agreement with the observations we have made in Part I. We have observed that, in general, the 
network sum-rate is not concave w.r.t. u E [0, 1] and that the optimal power allocation lies on the borders 
u* E {0, 1} for both relaying protocols. From Fig. |4] we also see that the fair PA policy that is, z/ = | can 
lead to a relatively significant performance loss. 

VIII. Conclusion 

In this paper we have introduced and studied a channel model which is very useful to analyze scenarios 
where each wireless device operating in unlicensed bands is offered the opportunity to exploit additional 
resources to increase its own transmission rate. Here the additional resources consist of spectrum plus 
cooperation power. We have analyzed in detail the problem of decentralized PA for the three dominant 
classes of relaying protocols (AF, DF, EF) and shown the existence of an equilibrium for the network for 
all these protocols. There can be multiple equilibria as proved for the game with AF. Fortunately, we have 
proved that in the case of AF with fixed amplification gain, the Coumot tatonnement converges to an NE 
that be predicted from the sole knowledge of the starting point of the game. This useful analysis needs to be 
extended to the case of the DF and EF protocols. We have seen that our problem can also be formulated as a 
Stackelberg game where the relay is the game leader and optimizes its amplification gain when AF is used, 
its location, and its power allocation between the two cooperative signals when DF or EF is implemented. 
This analysis is useful when adding some relaying nodes in a pre-existing network in order to improve its 
performance. In particular it has shown how to optimally locate the relay in a simple scenario and how the 
selfish behavior of the users leads to self-regulating the interference in the network. Important issues like 
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the uniqueness issue of the NE for DF and EF (under appropriate sufficient conditions), the convergence 
issue towards an NE in an iterative observation-reaction procedure need to be addressed; the latter issue 
is fundamental for cognitive transmitters where realistic spectrum sensing algorithms have be to assumed. 
From a broader perspective much works remains to be done to analyze the general problem of competition 
in cooperative channels. 

Appendix A 
Proof of concavity of u^^ w.r.t. 9^ 

We have to prove that r['^^ is also concave w.r.t. 9^''^ We will further consider only user 1 and prove 
the concavity of R^^ ^ w.r.t. 9^^ (for user 2 the proof is identical). Let us denote $1(6'^'^^) = 1 + r][^\ The 

(g) (g) 92/?''^ '^'I'^^'l — ('^'l )^ 

second derivative of R\ w.r.t. 9\ can be written as , = ^ . In what follows we will 



show that <^l<^i - ($'1)2 < 0. For sake of clarity we denote by A = a = (3 = p, 



7 = 4iV^?P2, S = h^£hiiJ9i'^p2, e, = ^ Irr-r^, , £2 = I 'J!r , and we define the function 



\hM\ fl('j)-„ ,1 





I'hr 


2 

pi 







+ £2^S^V^^^- We can now write $i(6'S'^^) = f{x{9'f'^)) with 



£2X'^ [(7 + ox) + px'' + IJ 
The first derivative of $1 is given by ^'{9[''^) = ^x' and the second derivative of $1 is given by (^"{9^^^) = 
^(x')^ + ^x", where x' and x" are the first and second order derivatives of x{9[''^). 

Let us define the functions A/'(-) and Ai{-) that give the numerator and the denominator of a fraction, 
respectively. One can show that: 

Afif'ix)) = -2Xxe2{l + ax) [{ax^ + ef) ((7 + Sx)^ + f3x^ 
+1) + X (1 + ax) {ej - x^) {6 (7 + 6x) + f3x)] , 

and that Mi^x) ^ M{f'{x)) = elx"^ + 6x)^ + px"^ + l]\ We note M(x) = Ar{f'{x)), ^o{x) 4 
^^ifix)), Moix) = M{f{x)) and then define U2{x) ^ ^ and M2{x) = 
We obtain that 

M2 = -2Xe2[{l + 2ax){el + ax^) + 3ax^{l + ax)][{-f + Sx)'^ + Px'^ + 1] 

-2Xe2x{3 + 2ax){l + ax){el - x'^)[S{-f + Sx) + px] (12) 
-2Xx^e2{l + ax)\£l - x^){S^ + P) 



April 16, 2009 



DRAFT 



16 



and 



Considering all these definitions we further obtain ^ = "^^"^^2^^"^^ and also that: 



(13) 



Ml 



Mo '' _ M ( '\2 
Mo Ml ' 



_l_ Mo N2M1-N1M2 ( ' 
^ Mo ^ ' 



NoNi{Mix' ~M2{x' f') + {MoN2Mi~M'lMQ){x f 
^ MoM'i ■ 

To show that $'^'<l> - < it is sufficient to show that both Ai = Mix" - M2{x'Y and A2 

AfoJ^2Mi - Mo axe negative. 

Knowing that x' = j^x^ and x" = ^x^, one can easily verify that: 



■^x^ [5 + (5)x^ + 675x + 7^ + 1] [(7 + 5xf + px"^ + l] 



< 0. 



(14) 



Furthermore, observing that A/q > 0, M.q > 0, Aii > and that A/2 < we have A2 < 0. In conclusion 
^ f„\ „ < and thus r\'^^ is a concave function of 6\'^\ 

Appendix B 
Number of Nash equilibria in the game Q^^ 

Before discussing these situations in detail, let us first observe that the two functions Fi{9j) are decreasing 
w.r.t. 9^ and also FM) = ^, FM) = where 6° = ^. 

J ^ ' Cii ^ J ' J Cij 

1) If < and d2 < 0, then the BR are constants BRi(%) = and thus the NE is unique {6^^, 9^^) = 
(0,0), for all Cii > 0, cji > 0. 

2) If di < and ^2 > 0, then it can be checked that the NE is unique, for all cu > 0, cji > 0: 9^^ = 
and 

^ , if d2< C22, 

1 , otherwise. 

3) If di > and d2 < 0, then, similarly to the previous item, we have a unique NE, for all ca > 0, 



/iNE 
^2 



Cji >0: 9^^ = and 



0', 



NE 



di 



,if 

, otherwise. 



di < cii, 
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4) If > and ^2 > 0, we have to take into consideration the parameters ca > 0, Cji > 0. 

a) If > 1 and ^2(1) > 1, then we have di > C12 + Cn and ^2 > C21 + 022- In this case the 
BR are constants i.e., BRi(^j) = 1 and thus the NE is unique (6'f^,6'^^) = (1, 1). 

b) If Fi{l) > 1 and ^2(1) < 1, then we have di > Cu + Cn and ^2 < C21 + 022- Here also the NE 
is unique and = 1 and 



, if d2> C22, 



, otherwise. 

c) If -Fi(l) < 1 and ^2(1) > 1, then we have di < C12 + cn and d2 > C21 + 022- Here also the NE 
is unique and 0^^ = 1 and 



^ ,if di>cu, 



, otherwise. 

d) If -Fi(l) < 1 and ^2(1) < 1, then we'll have di < C12 + cn and d2 < C21 + 022- This case is the 
most demanding one and will be treated in detail separately. 
At this point an important observation is in order. The discussed scenarios, for which we have determined 
the unique NE, have a simple geometric interpretation. If the intersection point {01,0 2) ^^ch. that either 
^1* G M \ [0, 1] or ^* G M \ [0, 1] then the NE is unique and differs from this point ((^f^, 9^^) ^ (^*, 01)). 
The case 4.(d) corresponds to the case where the intersection point {O^.O^) G [0, 1]^ is an NE point. Now 
we are interested in finding whether this intersection point is the unique NE or there are more than one NE. 
If < (ii < cii + C12 and < (i2 < C22 + C21 we have the following situations: 
1) If C11C22 = C21C12, then the curves described by Oi = Fi{Oj) are parallel. 

a) If di = d2, then the curves are superposed. In this special case we have an infinity of NE that 
can be characterized by (^f ^, 6^^^) G T where: 

r = {(^1,^2) €[0,1^1^1 = ^1(^2''^)}. 

b) If di ^ d2, then the two lines are only parallel. In this case it can be checked that the NE is 
unique. In order to explicit the exact relation of the NE, one has to consider all scenarios in 
function of the sign of the following four relations -Fj(O) — 1 and 9° — 1, i E {1,2}. We will 
explicit only one of them. Let us assume that Fi(0) — 1 < and 9° < which means that 
di < min{ci2, ^^^^ and ^2 < min{c2i, C22}. Here we have two sub-cases: 
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, If ^ < ^ then the NE is characterized by 9^ 

Ci o Coo ' •' i 



NE 



and = ^. 



Cl2 



C22 



d2 



C22 



, If > ^ then the NE is characterized by = and = 0. 

C12 C22 •' ^ C12C21 ^ 

2) Consider C11C22 7^ C21C12. Here we have to consider all cases in function of the sign of the four 
relations -Fj(O) — 1 and 9° — 1, i E {1,2}. We will focus on only one of them. Let us assume that 
Fj(0) — 1 < and 9° < and thus di < min{ci2,Cii} and ^2 < min{c2i, 022}- Here we have four 
sub-cases: 

, If ^ < ik and ^ > ^, then the NE is unique: ^^^e = q* ^nd ^^e ^ q* 

C22 C12 Cli C21 ' Til Z 

, If ^ > ^ and < then there are three different NE: 

C22 C12 Cll C21 

(^r,^2'^^)G{(^r,^2*),(o,^),(^,o)}. 

. If = and ^ < ^, then there are only two different NE: ^^E) ^ 1(0^ )^ (ik^ 0)}. 

C22 cxx C21 C22 Cxi 

. If > and ^ = ^, then there are two NE: (9^^, 9^^) G |(^, 0), (0, ^)|. 

C22 C12 Cll C21 ' \ L 1 Z / LVcii ' ^' ^ ' C22 ' J 

In conclusion, the cases where there are multiple NE are: 1) either when the lines 9i = Fi{9j) are 
superposed and the game has an infinity of NE or 2) when the lines have a unique intersection point {9^, ^2) 
that lies inside [0, 1] x [0, 1]. In the latter cases there can be one, two or three different NE. 



Appendix C 
Proof of concavity of uf^ w.r.t. 9- 

We want to prove that for each user r[''^ is concave w.r.t. 9l^\ Consider w.l.o.g. the case of user 1. We 
follow a similar approach to the AF case, based on the second derivative of R^^^ given in Eq. After 



some manipulations we obtain the following relation: 
(for sake of clarity we have denoted h^^^ by hij): 



Ml - M2 with Mfc 



k E {1,2} where 



NMi 
NM2 



DMi 
DM2 



As A7 {h2i^el,^^ p^+^Mhl^pii^+N^"^' 



- 2 



A6A72(ft2i2e(^'P2+i.Wh2,P,f«)+Ar{ 



+ 2 



A43(i.(«)h.2,P,(«') 

Ae A7 (h2i24'' P2+^hl^P^''^+N[''^] ' ^ 



1 I Ae 
^ A4 ' 



(15) 
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and 



Ai 
A2 
A3 
A4 

A5 
As 
A7 
As 



hu hre^f^P^ + h2l h2r A> 

hM'^ P^ + hM^^ P2 + N^''\ 
jy{q) _^ A3A2-Af 



3(9) 



+ 
2/3(9) 



(9) 



{h2^e^^^ P2 + vmi.p^"^^ + iv;^)) + /i2.'^^'^^ P2 vW^.p^"^ + atJ"), 



/i2r dr, P, 



^{g) ^ A3A2-Af 



As/iu'^S'^ Pi + (/i2i'^? P2 + ^/^.^iP.^^^ + A^J^^) /^i.'^i^^ Pi 

/ll,2Pi A2 + Ag/lll'Pl - 2Ai/iii /li. Pi, 



Arfeii^e^^' Pi 



+ As/^ii'Pi + (/Z2i'^?^ P2 + z/(^)/i,\Pi^^ + A^l''^) /^i.'Pi. 



(16) 

First note that A3A2 - A^ = (^^ hj^Pr^"'' + AT^^^) A3 + A^M'''^ + {hnhr - h2ih2r)^''^ pA'^ P2 > and 
thus we have that all the terms A^ > 0, A; G {1, . . . , 8}. Also we can easily see from Eq. (fT5l) that M2 > 0, 
DMi > 0. Thus if we prove that A^Mi < the concavity of R^^ '^ will be guaranteed. In this purpose we 



miMi with 

DNAIi ^^"-^^ 



plug the expressions of A4, A5, A7, Ag into Eq. (1151) and obtain that A^Mi 

-2 PiV('^)/i2,F('') \^(h,r h2, - /ill h2rf P2 + h^' (^hl.Pr^'^ + N^'^ 



NNMi 



hir^ h2l^0^^ P2^ + hir"^' 



(9) 



+ h^.Ve'i'^ P2V 



+h2M'^P2{jy^^^K,Pr^'^ 



+ A^J'^)] [h2i^e^^ P2 + ^/i^iPj'^ + A^J'^^ + z/(^)/i2,P< 



(9) 



and 



DNMi 



ihr h2l - hu h2rf ^i'^ Pi ^^'^ P2 (/l2l'^? P2 + {^mi^P^'^ + A^l'^^; 



A^i 



(9)^ 



+ Ari^) /l2^Pj^)/l2l'^?^ P2 + A^i'^ P^'^^hl^P^'^(^)hl^P^'^ 
+ h^M^'^ Pi /l2l'^5''^ P2 (^/l^iPj"^ + A^l''^) + /ll.'^i''^ Pi i^hl.Pr^'^ + N['^) 
+ h2M2''''P2'(^h^rlPr'^ + N^"'^) /I21' + /l2.'^^^ P2 (^hj^Pr^'^ + A^J 



{9)n 



+A^i^^ /iii^^i'^) Pi /i2i'^^'^^ P2 + N^" hu''e'^> Pi {u('')hl,P}''> + Nl"') + N^"' h2i%'>~P2 



M) 2/){9) 



(9) 



^(9)^ 



M) ^_ 4zi(9)^ 



+2 A^.^^) /i2i'^5'^ P2 (z^W^^Pr^^^ + A^f + N!^"^ {iyMhl,P:^'> + A^f ^ 

+ /i2r'^^'^ P2 (^K^P^"^ + Ar{^)) U^^^hl^P^'^ 

Therefore we obtain the desired result NMi < and thus Mi > 0, which implies that 
the channel parameters. 



,(9) 



(9)n 



r(9)^ 



,(9) 



< 0, whatever 
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Best Response Functions (p^=1 , p^=3, p=2) 




Ql , , , , g 

0.2 0.4 0.6 0.8 1 

61 



Fig. 1. Best replies for a system composed of an IC in band (1) and IRC in band (2) when the ZDSAF protocol is assumed (fixed amplification 
factor). The number of equilibria is generally three as indicated the figure. 



Achievable sum-rate 
2.67 1 , , , , , 




Fig. 2. ZDSAF relaying protocol with fixed amplification gain. Achievable network sum-rate at the NE as a function of Ar € [0, Ur] for 
L = 10m, e = 0.5m, Pi = 20dBm, P2 = 23dBm, Pr = 22dBm, A^i = lOdBm, iV2 = 9dBm, Nr = 7dBm, 7'^' = 7'^' = 2. The optimal 
amplification gain A* = 0.05 < 5^(1, 1) = 0.17 meaning that saturating the relay power constraint is suboptimal. 
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Fig. 3. ZDSAF relaying protocol, full power regime. L = 10m, e = Im, Pi = 20dBm, P2 = 17dBm, P,. = 22dBm, iVi = lOdBm, 
N2 = 9dBm, Nr = 7dBm, 7*^' = 2.5 and 7'^' = 2. (a) Achievable network sum-rate at the NE as a function of {xn^yiz) £ [-L,L]^ (the 
optimal relay position {x^,y^) = (—1.2, 1.7) lies on the segment between 5i and Vi). (b) Power allocation policies at the NE {6^^ ,9^^) 
as a function of {xizjyn) £ l^L, L]^ (the regions where the uses allocate their power to IRC are almost non overlapping.). 
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Fig. 4. EF vs. DF relaying protocol. Achievable network sum-rate at the NE as a function of v £ [0, 1] for L — 10m, e = Im, Pi = 22dBm, 
Pa = 17dBm, Pr = 23dBm, Ni = 7dBm, TVa = 9dBm, = OdBm, 7(1) = 2.5 and 7(2) = 2. The optimal relay PA i^* = 1 is in favor 
of the better user and outperforms the uniform relay PA 1/ = 0.5 for both EF and DF. 
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